(Bilenko; Mooney, 2005) There are demands for this type of software and there is a vast area of information analysis where text mining is beginning to get important. One field is in the analysis of literature and research reviews.
Literary and Scientific Demands:
There is more demand for the text mining in the literature review and library sections. There are extensive researches done for creating algorithms for book-based text mining. Researchers Sophia Ananiadou et al. (2009) have used text mining solutions in creating literature reviews. The creation of a text mining framework for systematic reviews and the creation of what the researchers Ananiadou, Sophia; et al. (2009) called as the 'service exemplar' were used as a test bed for deriving the possible requirements for text mining tools for literature services. Thus the use of text mining can enhance literature reviews and also create a new stream of literary analysis. (Ananiadou, Sophia; et al., 2009)
In another research and study of news and internet, Montes et. al. (1999) established that text mining techniques are effective in the analysis of internet and newspaper news. They focused on the current topics of opinion that ensued from the Spanish examples. They used a classical statistical model based on average calculus, distribution analysis, and standard deviation and the results shoed the society interests and its changing nature and they could pinpoint the change points.
Likewise the text mining has been effective in medical research, which is significant because the use of the method for another entirely different concept like medical research shows how significant it is. For example, Natarajan et al. (2006) compared the expression profiles for the same cell lines under the influence of epidermal growth factor -- EGF, an important growth factor. We found a set of 72 genes that are significantly differentially expressed as a unique response to S1P. "Based on the result of mining full-text articles from 20 scientific journals in the field of cancer research published over a period of five years," Natarajan, et al.; (2006) said they found a gene to gene interaction networks for seventy two different types of genes. Thus the researchers, Natarajan et al. say that the "automated extraction of information from biological literature will prompt the progress of the discoveries in biological knowledge." (Natarajan, et al., 2006) the other uses are commercial and business oriented and also for analysis of behemoths like the internet.
Uses and Advantages:
Text files, hold over eighty percent of any business and is the most difficult to find or use and therefore business find the prospect of text mining attractive. The new generation of text mining tools is increasingly being used by companies for the purpose of discovering relationships and to summarize the information. One such is the 'ClearResearch' software from 'ClearForest Corporation.' This software 'ClearResearch' uses the pattern-matching and shows the relation as a graph. Though not as accurate as the established data mining tools, text mining tools are basically effective. (Robb, 2004)
Other software in the market includes SAS text mining and Wordstat which have established a presence in the market. Wordstat developed by Provalis Research, and SAS Textminer from SAS Company. In both cases the software was found to have flaws and benefits and both packages have features that researchers can use to find associations. (Davi; Haughton; Nasr; Shah; Skaletsky; Spack, 2005) but in the process of extracting themes from unstructured data, they are not helpful. Thus as of now the available software searches for specific terms, or categorize documents based on the terms. This is not satisfactory because the same term may mean different things for different people and thus it can be stated that in the text mining approach, which is based on analysis is not yet complete or attained to the full. The text mining can be used for the process of reviewing a product that is being marketed by analyzing the reviews that are obtained by surveys and since it is of the unorganized data type the mining will help establish things like identifying the facts about product features, and the public opinion on the product and also find the polarity of opinions and rank an opinion which would not be possible other wise. (Kao; Poteet, 2007)
Though this is the general need, there are obstacles in the diffusion of text mining. One is that there is no conclusive research that has been shown that a particular method has been largely successful....
Our semester plans gives you unlimited, unrestricted access to our entire library of resources —writing tools, guides, example essays, tutorials, class notes, and more.
Get Started Now